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Bloom syndrome, an autosomal recessive disorder of the BLMgene, confers predisposition to a broad spectrum of early- 
onset cancers in multiple tissue types. Loss of genomic integrity is a primary hallmark of such human malignancies, but 
many studies using disease-affected specimens are limited in that they are retrospective and devoid of an appropriate 
experimental control. To overcome this, we devised an experimental system to recapitulate the early molecular events in 
genetically engineered mouse embryonic stem cells, in which cells undergoing loss of heterozygosity [LOH) can be 
enriched after inducible down-regulation of Blm expression, with or without site-directed DNA double-strand break (DSB) 
induction. Transient loss of BLM increased the rate of LOH, whose breakpoints were distributed along the chromosome. 
Combined with site-directed DSB induction, loss of BLM synergistically increased the rate of LOH and concentrated the 
breakpoints around the targeted chromosomal region. We characterized the LOH events using specifically tailored ge- 
nomic tools, such as high-resolution array comparative genomic hybridization and high-density single nucleotide poly- 
morphism genotyping, revealing that the combination of BLM suppression and DSB induction enhanced genomic 
rearrangements, including deletions and insertions, whose breakpoints were clustered in genomic inverted repeats and 
associated with junctional microhomologies. Our experimental approach successfully uncovered the detailed molecular 
mechanisms of as-yet-uncharacterized loss of heterozygosities and reveals the significant contribution of microhomology- 
mediated genomic rearrangements, which could be widely applicable to the early steps of cancer formation in general. 



[Supplemental material is available for this article.] 

Genomic instability, which has been implicated as a major un- 
derlying mechanism of human carcinogenesis, is considered to be 
driven largely by dysfunction of the so-called genome "caretaker" 
genes (Kinzler and Vogelstein 1997). Analogous to Darwinian 
evolution, carcinogenesis involves the step-by-step accumulation 
of genomic alterations at multiple loci and subsequent natural 
selection of the affected cells in the somatic environment. The 
initial steps of the genomic alterations are, therefore, difficult to 
assess retrospectively in already-diagnosed cancer samples, and 
how caretaker gene dysfunction contributes to each step remains 
unknown. 

Germline deleterious mutations in the Bloom syndrome gene 
{BLM), one of the genome caretaker genes belonging to the RecQ 
family of DNA helicases, cause the rare autosomal recessive dis- 
order Bloom syndrome. This syndrome predisposes affected in- 
dividuals to a wide variety of malignancies (German 1997) that are 
associated with increased somatic loss of heterozygosity (LOH), 
including mitotic crossover (CO), noncrossover (NCO), and re- 
gional chromosome deletion (DEL) (Luo et al. 2000). The wide 
variety of Bloom syndrome-associated cancers implies that the 
initial steps of carcinogenesis in the BLM-deficient background are 
common to many cell types. The cancer predisposition phenotype 
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with increased rates of somatic LOH has been recapitulated in the 
5/m-mutant mouse model (Luo et al. 2000). We have previously 
generated a Blm^^^^^^^ mouse embryonic stem (ES) cell line in which 
the expression of Blm can be transiently switched off using the 
doxycycline-inducible (Dox-inducible) system (Fig. lA; Yusa et al. 
2004; Horie et al. 2011), allowing detailed investigation of the 
immediate impact of Blm deficiency on genomic integrity. The 
genome of mouse ES cells is relatively stable, and the spontaneous 
mutation rate is significantly lower than that in other somatic 
cells, such as mouse embryonic fibroblasts (Cervantes et al. 2002). 
Accordingly, mouse ES cells offer a powerful experimental platform 
in which de novo mutations and rearrangements are detectable 
efficiently. In addition, the Fl -hybrid nature of the Blm^'^^^'^ ES cell 
line, which is derived from crosses between two divergent inbred 
mouse strains, C57BL/6 (B6) and 129S4/SvJae (129) (Eggan et al. 
2001), together with the recent development of genome-wide 
high-density single nucleotide polymorphism (SNP) markers pro- 
vided by next-generation sequencing (Keane et al. 2011; Yalcin 
et al. 2011), facilitates high-resolution mapping of LOH break- 
points and definition of the allelic origin of each chromosomal 
segment. 

A previous study demonstrated that site-directed DNA dou- 
ble-strand break (DSB) induction enhanced the rate of CO-medi- 
ated LOH in B/m-deficient {Blm^^^^^^^ ES cells (Larocque et al. 201 1), 
suggesting that the DSB is a component of the mechanism of so- 
matic LOH in Bloom syndrome patients. To provide a compre- 
hensive view of the early processes of LOH in the absence of BLM, 
here, we present a molecular characterization of the genomic 
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Figure 1. Experimental design and synergistic enhancement in the rate of LOH by the combination of transient loss of BIm and DSB induction. (A) 
Schematic representation of the Dox-inducible knockdown construct for the Blm^^^^^^^ ES cell line (Horie et al. 2011). A Tet-off cassette, consisting 
of tetracycline-regulated transactivator (tTA) and a basal promoter containing Tet operator sequences (TetO), was inserted into both alleles of the BIm gene in 
mouse Fl (B6 x 1 29) ES cells. (Left) In the absence of Dox, the tTA protein binds to the tetO sequence and activates BIm gene expression. (Right) The 
application of Dox prevents the binding of tTA to tetO and represses BIm expression in a reversible manner. (B) Targeting strategy. A wild-type neomycin 
resistance (neo) gene cassette was inserted into exon 3 of the Aprt gene on the B6-derived chromosome 8 of Blm^^^^'^^'^ ES cells, accompanied by 
the introduction of an l-Scel recognition sequence into the Swal site 2.3 kb centromericto exon 3, resulting in Blm^^^^^'^Aprt"^°^^ ES cells. (DT) Diphtheria toxin 
A fragment, (WT) wild type. (C) A negative selection scheme for LOH using 2-fluoroadenine (2-FA). In theory, five LOH patterns — crossover (CO), non- 
crossover (NCO), chromosomal duplication (DUP), chromosomal loss (LOSS), and partial chromosomal deletion (DEL) — could be recovered. (D) Ex- 
pression vectors used in this study. 5'PB and 3'PB are terminal repeats of the piggyBac transposon. (EGFP) Enhanced green fluorescent protein, (IScelo) 
codon-optimized l-Scel, (IScelo-Geminin) a fusion gene of IScelo and geminin, (CAG) CAG promoter, (IRES) internal ribosomal entry site, (bsd) blasticidin 
resistance gene, (polyA) polyadenylation signal. (£) Schematic representation of the cell culture protocol. See text for details. (0 Total number of 2-FA- 
resistant colonies generated from 1x10^ Blm^^^^^^^Aprt"^°^^ ES cells subjected to different treatments as indicated: Dox(+) or Dox(-), 5-d culture in 
the presence or absence of Dox to switch BIm expression "off" or "on", respectively; IScelo(+) or IScelo(-), transfection of the IScelo vector for DSB 
induction or the control vector, respectively; Geminin(+), transfection of the Geminin vector. The error bars represent the standard deviation of four to 
seven independent experiments. (*) P< 0.05, Student's f-test. 
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structural changes after transient loss of BLM and/or DSB induction, 
based on a combination of high-density polymorphism typing and 
high-resolution array comparative genomic hybridization (array- 
CGH) analysis. 

Results 

Transient loss of BLM increases the rate of LOH witli 
synergistic enhancement by site-directed DSB induction 

By engineering Blm^'^^^'^ ES cells (Fig. lA; Horie et al. 2011), we 
devised an experimental system to recapitulate the early molecular 
events affecting genomic integrity in proliferating cells in vitro. We 
focused on the genomic region encompassing the adenine phos- 
phoribosyl transferase (Aprt) gene on mouse distal chromosome 8, 
because the Aprt gene itself could serve as a negative selection 
marker for LOH, as described below. The B6-derived allele of the 
Aprt gene in Blm^^^^^^^ ES cells was disrupted by targeted insertion of 
a wild-type neo-containing vector into the third exon (Fig. IB; 
Supplemental Fig. SI A). The purpose of this targeting vector was 
twofold: negative selection for LOH and site-directed DSB in- 
duction. First, disruption of the endogenous Aprt gene provides 
a negative selection system for LOH events. The Blm^'^''^'^Aprf'^^^ ES 
cells, in which the B6-derived copy of Aprt is disrupted, are still 
sensitive to 2-fluoroadenine (2-FA), because products from the 
other copy of Aprt convert 2-FA into cytotoxic metabolites (Hong 
et al. 2006). However, LOH events involving the Aprt gene locus, 
which could include any of CO, NCO, whole chromosome dupli- 
cation (DUP), whole chromosome loss (LOSS), or DEL (Fig. IC), 
could result in disruption of both alleles of Aprt and consequent 
resistance to 2-FA. Second, the targeting vector allows site-directed 
DSB induction by introducing an I-Scel recognition sequence 
centromeric to the Aprt gene on chromosome 8 (Fig. IB). To deliver 
the yeast-derived I-Scel endonuclease to the I-Scel recognition 
sequence in vivo in Blm^^^^^^^Aprf^^^^^ mouse ES cells, a piggyBac 
(PB) transposon-based expression vector of a mammalian codon- 
optimized version of the I-Scel gene (IScelo) was constructed (re- 
ferred to as pPB-CAG-IScelo-IB) (Fig. ID; Supplemental Fig. SIB). 
The codon optimization resulted in increased efficiency of cleav- 
age at the I-Scel restriction site in mammalian cells (Supplemental 
Fig. SIC). The efficiency of cleavage was further evaluated by loss of 
the I-Scel site caused by imprecise nonhomologous end- joining in 
5/^tet/tet4p^fneoA £5 ^^11^^ indicating -40% cleavage at the I-Scel 
site in vivo (Supplemental Fig. SID). 

Using this Blm^^^^^^^Aprt''^''^^ ES cell system, we examined the 
rate of LOH arising at the Aprt gene locus in 5/m-deficient and Blm- 
proficient ES cells, with or without site-specific DSB induction. The 
^^^tet/tetj^p^^neo/+ ^^^^^ ^^^^ cultured for 5 d in the absence or 

presence of Dox to continue, or transiently switch off, Blm ex- 
pression, respectively (Fig. IE). On day 1 of the culture, the cells 
were transfected with the IScelo expression vector or a control 
vector and later, from day 3 to day 7, subjected to Blasticidin (bsd) 
selection to enrich for transfected cells. On day 8 of the culture, an 
equal number (1 X 10^) of bsd-resistant cells were plated into a 
10-cm dish and subjected to 2-FA selection to compare the number 
of resistant colonies. The number of Dox(+)ISceIo(+) and Dox(+) 
ISceIo(-) colonies was significantly higher than that of Dox(-)ISceIo(+) 
and Dox(-)ISceIo(-) colonies, respectively, demonstrating that 
transient loss of BLM elevated the rate of LOH both in DSB-induced 
and uninduced conditions (Fig. IF). The IScelo-mediated DSB 
induction did not significantly increase the rate of LOH in 
5Zm-proficient cells [Fig. IF, Dox(-)ISceIo(+) vs. Dox(-)ISceIo(-)]. 



However, under transient loss of BLM, DSB induction resulted 
in synergistic enhancement of the rate of LOH [Fig. IF, Dox(+) 
ISceIo(+) vs. Dox(+)ISceIo(-)], which is consistent with a previous 
study (Larocque et al. 2011). 

The occurrence of somatic LOH could be dependent on the 
cell-cycle phase. Thus, we constructed an S/G2/M phase-specific 
IScelo expression vector by fusing the IScelo gene with the gene 
encoding 110 amino acid N terminus of human Geminin (Sakaue- 
Sawano et al. 2008), resulting in pPB-CAG-IScelo-Geminin-IB 
(Geminin) (Fig. ID). Although the Geminin vector did not exert 
the expected cell-cycle specificity (for undefined reasons) (Sup- 
plemental Fig. S2A), transfection of Dox-tvesitedBlm^'^^^'^ Apr f'""^^ ES 
cells with this vector produced a similar number of 2-FA-resistant 
colonies, as did the IScelo vector (Fig. IF). 

LOH breakpoints are uniformly distributed throughout 
the chromosome 

We next performed SNP analyses of LOH arising at the Aprt gene 
locus in 5/m-deficient or 5/m-proficient ES cells. LOH profiling of 
the isolated 2-FA-resistant clones with a panel of 12 SNP markers 
on chromosome 8 revealed that, without DSB induction, transient 
loss of BLM resulted in a fairly even distribution of LOH break- 
points (Fig. 2A; Supplemental Table SI). Such a chromosome- 
wide distribution was neither chromosome-specific nor selec- 
tion method-specific, because we observed a similar distribution 
pattern in an additional two ^/m^^'^^^^-derived mutant ES cell lines, 
Blm'''^'''Fasr'*^^ and Blm'''^'''Fasr''*^^NanogP''''''^^^\ in which a 
positive selection marker for LOH was inserted at the Fasl locus 
on distal chromosome 1 (Supplemental Fig. S3A,B; Supplemental 
Table SI) and a negative selection marker for LOH was inserted at 
the Nanog locus on distal chromosome 6 (Supplemental Fig. S3C-E; 
Supplemental Table SI), respectively. This pattern was also observed 
in B/m-proficient conditions (data not shown), suggesting that the 
chromosomal distribution of LOH breakpoints is 5/m-independent. 

Site-directed DSB induction concentrates LOH breakpoints 
on a focused region of the genome in ^/m-deficient ES cells 

The combination of transient loss of BLM with site-directed DSB 
induction dramatically altered the distribution of LOH break- 
points, such that they were more localized to the vicinity of the 
Aprt gene locus (Fig. 2B, left). Out of 64 resistant clones shown in 
Figure 2B, 55 clones underwent LOH with retention of the B6 allele 
(denoted hereafter as LOH^^) at the SNP12 locus, telomeric to the 
Aprt locus (Fig. 2B,C, Group I), while eight clones displayed no 
LOH patterns and one clone (clone 68) displayed LOH with re- 
tention of the 129 allele (denoted hereafter as LOH^^'^) at the 
SNP12 locus (Fig. 2B,C, Group II). Twenty-seven (49%) of the 55 
clones in group I retained B6/129 heterozygosity at the SNPll 
locus, representing the predominant occurrence of LOH near the 
Aprt locus compared to the LOH pattern in IScelo-nontreated cells 
(P < 0.01, test) (Fig. 2A-C). To determine the boundaries of LOH 
with higher resolution, we analyzed the genomic DNA of these 27 
clones with an additional five SNP markers between the SNPl 1 and 
SNP 12 loci (Supplemental Table SI). Among the 27 clones, 13 
clones underwent LOH^^ at the SNP u-50k locus, which is 50 kb 
centromeric to the Aprt locus (Fig. 2B,C; Supplemental Table SI). 
Meanwhile, the other 14 clones retained B6/129 heterozygosity at 
the SNP u-50k locus, indicating that their LOH breakpoints were 
between u-50k and Aprt (Fig. 2D, Group I). Nine clones in group II 
were likewise analyzed by the same SNP panel and demonstrated 
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Figure 2. Polymorphic marker analyses of chromosome 8 in Blm^^^^^^^Aprt"^"^^ ES cell clones selected by 2-FA for LOH at the Aprt locus. (A) (Right) 
Twenty-nine independent Dox(+)IScelo(-) ES cell clones resistant to 2-FA (numbered as indicated at the fop) were genotyped for 1 2 SNPs (SNPl to SNPl 2; 
SNP4 is not shown because of the low specificity of the PCR reaction) covering chromosome 8 (Supplemental Table SI). Open squares indicate the 
heterozygous genotype (B6/129). Closed black and red squares indicate LOH with retention of the B6 and 129 alleles, respectively. (BA) Control 2-FA- 
unselected Dox(-)IScelo(-) Blm^^^^^^^Aprf^°^^ ES cells. {Left) Histogram representation of LOH ratio (total number of clones with LOH/total number of 
clones examined x 100 [%]) at each marker locus. (S) (Right) Sixty-four independent Dox(+)IScelo(+) ES cell clones resistant to 2-FA (numbered as 
indicated at the fop) were genotyped for SNPl to SNPl 2. The 64 clones are divided into two groups: Group I, 55 clones carrying LOH with retention of the 
B6 allele (LOH^^) at the SNPl 2 locus; Group II, the other nine clones carrying no LOH or LOH^^^ at the SNPl 2 locus. (Left) Histogram representation of 
LOH ratio (%) at each marker locus of Group I clones. (C) Flow diagram of the SNP analysis performed in this study. The number of independent clones 
analyzed is shown in parentheses. (D) Detailed SNP analysis of the region between SNPl 1 and SNPl 2 in the clones shown in B. An additional five SNPs (u-50k, 
u-IOk, u-5k, d-6k, and d-50k) were genotyped (Supplemental Table SI), (f) Ten independent Dox(-)IScelo(+) ES cell clones resistant to 2-FA (numbered 
and categorized into Group Ic or lie as indicated) were genotyped for SNPl to SNPl 2. In the following experiments, clones indicated by asterisks at the 
bottom (D and £) carry DELs involving this region. Clone 11-73 (dagger) carries a missense mutation at exon 1 of the 1 29-derived Aprt gene. Clone llc-46 
(double dagger) retains heterozygosity at the Aprt locus, and the mutation in the 1 29 allele is undefined. 
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complex patterns of LOH within the 50 kb surrounding the Aprt 
locus (Fig. 2D, Group II). 

In addition, Geminin vector-mediated DSB induction also 
resulted in the isolation of a variety of 2-FA-resistant Blm^''^^^''^Aprf^'^^^ 
ES cell clones that had undergone LOH (Fig. 2C). All eleven in- 
dependent clones isolated displayed LOH^^ at the SNP12 locus. 
Among them, four clones (36%) located the LOH breakpoints be- 
tween the SNPll and SNP12 loci: two clones within 5 kb and one 
clone within 10 kb from the Aprt locus (Supplemental Fig. S2B). 

High-resolution array-CGH with high-density SNP analysis 
allows discrimination of DEL-associated LOH from other LOH 
patterns 

The distinct clustering of LOH breakpoints in the vicinity of the 
Aprt gene locus, promoted by I-Scel-mediated DSB induction, 
allowed for in-depth analyses of LOH. We designed a custom high- 
resolution array-CGH platform covering the 8-Mb region from the 
telomere of mouse chromosome 8 with an average resolution of 
300 bp, which encompasses the Aprt gene locus in the middle. 
With the custom array-CGH, we analyzed 30 independent clones 
with complex LOH patterns: 14 Group-I clones heterozygous for 
SNP u-50k, nine Group-II clones, three Geminin-transfected clones 
heterozygous for SNP u-50k, and four clones heterozygous for 
SNPll as controls (Fig. 2C). The combination of array-CGH and SNP 
data allowed clear discrimination of DEL-associated LOH from the 
other LOH patterns (Fig. IC) and prompted us to group the exam- 
ined ES cell clones into three categories: clones with no detectable 
DEL (non-DEL), clones with DEL on the 129 allele (129-DEL), and 
clones with DEL on the B6 allele (B6-DEL) (Fig. 3; Supplemental 
Fig. S4). The "non-DEU' group contained 21 clones: 17 with IScelo 
transfection (1-14, -16, -17, -24, -40, -46, -48, -58, -59, -63, -77, 11-15, 
-18, -68, -73, G-10, and -11) and four controls without IScelo 
transfection (8, 17, 24, and 30) (Fig. 2A,B). The other seven clones 
(clones 1-47, -49, -57, II-2, -56, -69, and -74) carried a variety of large 
DLLs on the 129 allele, ranging from 9.5 kb to 352 kb, and were 
categorized into the "129-DEL" group (Figs. 2D, 3A,B; Supplemental 
Table S2). The remaining two clones, 11-13 (Fig. 2D) and G-4 (Sup- 
plemental Fig. S2B), were categorized into the ''B6-DEL" group, in 
which DLLs on the targeted B6 allele were accompanied by complex 
insertions at the breakpoints (Fig. 3C; Supplemental Table S2). 
Overall, given that the DLLs were detected in the clones with DSB 
induction (9/26), but not in the controls with no DSB induction 
(0/4), it is likely that the DLLs occurred during the process of DSB 
repair in 5/m-deficient ES cells. 

5/m-deficient ES cells with LOH but no detectable DELs undergo 
mitotic interhomolog recombination resulting 
in CO frequently associated with gene conversion 

Among the above 17 non-DEL clones with IScelo transfection, 
only clone 73 displayed no CO points within the region (Fig. 2D). 
PGR amplification and direct sequencing revealed that this clone 
was a compound heterozygote {Aprf^^^^~) at the Aprt locus, with 
a neo insertion on the B6 chromosome and a spontaneous mis- 
sense mutation on the 129 chromosome, which together con- 
ferred 2-FA resistance (data not shown). We then further analyzed 
the remaining 16 non-DEL clones with an additional nine SNP 
markers covering the 5-kb region between SNP u-5k and Aprt 
(Supplemental Fig. S5; Supplemental Table SI), demonstrating that 
at least one mitotic CO event occurred within this proximal 
flanking region of A/7rt (Supplemental Fig. S5, arrows). Intriguingly, 



the knocked-in I-Scel sites often clones (1-24, -40, -46, -48, -58, -59, 
11-15, -18, -68, and G-11) completely recovered the 129-derived 
wild-type genomic sequence after DSB induction (Supplemental 
Fig. S5), while those of the other six clones (1-14, -16,-17, -63, -77, 
and G-10) remained either intact (1-14, -16, -63, and G-10) or 
slightly modified (1-17 and -77) on the B6 allele (data not shown). 
The recovery of the I-Scel allele to the wild type can be explained by 
interhomolog NCO recombination in 5/m-deficient ES cells, as 
explained in a previous paper (Larocque et al. 2011), and relatively 
short extents of gene conversion were observed (Supplemental Fig. 
S5, brackets). Similar interhomolog NCO recombination events 
were also detected in two other clones (1-47 and 11-56) belonging 
to the 129-DEL group, whose rearrangement patterns are further 
illustrated in the following section (Fig. 3A,B). 

Microhomology is present at the vast majority of genomic 
rearrangement breakpoints in 5/m-deficient ES cells 

Custom high-resolution array-CGH identified large DELs in seven 
129-DEL clones and two B6-DEL clones. Among the seven 129- 
DEL clones, three clones (1-47, -49, and -57) carried DELs (ranging 
from 9.5 kb to 13 kb) at the flanking centromeric region of the Aprt 
locus on the 129 chromosome, associated with interhomolog CO 
of the targeted B6 chromosome distal to the DEL interval (Fig. 3A). 
In clone 1-47, two 129-derived genomic DNA fragments were 
inserted into the DEL interval (Fig. 3 A; Supplemental Fig. S6). The 
other four clones (II-2, -56, -69, and -74) carried DELs (ranging 
from 18 kb to 352 kb) encompassing the Aprt locus on the 129 
chromosome with NCO (Fig. 3B). Notably, in clone 11-56, a B6- 
derived short DNA fragment (<1.6 kb) flanked the centromeric 
breakpoint of the DEL on the 129-derived chromosome (Fig. 3B; 
Supplemental Fig. S6). A paucity of informative SNPs usually 
hampers detection of short interhomolog gene conversion tracts, 
leading to underestimation of the frequency of such DEL-associ- 
ated gene conversions. The two B6-DEL clones (11-13 and G-4) 
carried 15-kb and 10-kb DELs on the B6 chromosomes, associated 
with interhomolog NCO and CO recombination of the Aprt-tar- 
geted B6 chromosomes, respectively (Fig. 3C). PGR cloning and 
sequencing of the DEL breakpoints revealed that a 432-bp DNA 
fragment from the neighboring Gains gene locus on the 129 chro- 
mosome was inserted in the inverted orientation into the DEL in- 
terval on the B6 chromosome in clone 11-13, whereas a 132-bp 
mosaic DNA fragment with a seamless fusion of the 129- and B6- 
alleles of the Aprt-Galns intergenic region was inserted in the 
inverted orientation into the DEL interval on the B6 chromosome 
in clone G-4 (Fig. 3C; Supplemental Fig. S6). The presence of these 
breakpoint insertions is consistent with DNA copy number 
changes of the corresponding regions identified by array-CGH 
(Supplemental Fig. S4, red arrows). 

At least nine large DELs and 13 rearrangement breakpoints 
were identified in the 26 independent Blm^^^^^^Aprf^''^-' ES cell 
clones that had been isolated by 2-FA selection after transient 
suppression of BLM and site-directed DSB induction. Remarkably, 
nine out of the 13 breakpoints (9/13, 69%) exhibited short junc- 
tional microhomologies of 2-7 bp that are shared with the proxi- 
mal and distal reference sequences (Fig. 3; Supplemental Fig. S6). 
Moreover, although no microhomology was observed at the DEL 
breakpoints in clones 1-49 and 11-13 (the telomeric breakpoint), the 
other two breakpoints in clones 1-57 and 11-74 contained short 
insertion sequences of 3 and 8 bp, respectively, that appeared to 
have been templated from sequences near the breakpoints and 
again showed junctional microhomology between the original 
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Schematic diagrams of DEL-associated LOH identified at the Aprt locus in Blm^^^^^^^Aprf^°^^ ES cell clones that were subjected to transient BLM 
suppression and DSB induction. High-resolution array-CGH and a high-density SNP panel (Supplemental Table SI), followed by direct sequencing of the 
rearrangement breakpoints, were used. (Thick blue bar) B6-derived chromosome 8. (Thick red bar) 1 29-derived chromosome 8. (Black rectangle) neo- 
resistance gene cassette. (White rectangles) Exons of the Aprt gene. Blue (B6) and red (129) arrows represent a magnified view of the breakpoint 
boundaries. The direction of the arrows corresponds to chromosomal orientation from centromere to telomere in the mouse reference genome sequence 
(NCBI Build 37/mm9). Yellow segments represent genomic repetitive sequences such as a short interspersed nuclear element (SINE) or long terminal 
repeat (LTR) retrotransposon. Light green segments represent part of the l-Scel recognition site. Microhomology regions, shared by the two ends of the 
junctions, are indicated by double-stranded DNA sequences, while short DNA insertions are indicated by ssDNA sequences. {A,B) Clones carrying DEL on 
the 1 29 allele (1 29-DEL) associated with interhomolog crossover (CO) of the distal B6 chromosome {A) or noncrossover (NCO) (6). (C) Clones carrying 
DEL on the B6 allele (B6-DEL). The lengths of the genomic segments are depicted arbitrarily. Short 1 29-derived (red) segments embedded within the B6- 
derived (blue) chromosomes {A, 1-47; 5, 11-56) represent interhomolog NCO events that copy the wild-type segment to the l-Scel DSB site, probably by 
gene conversion. 
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and the inserted sequence ends. Similar events have been observed 
in human and Drosophila melanogaster germline genome rear- 
rangements (Yu and McVey 2010; Liu et al. 201 1, 2012), and this is 
described in more detail in the following section. The vast majority 
(11/13, 85%) of the genomic rearrangement breakpoints arising 
under transient loss of BLM displayed microhomology. 

In contrast to the 5/m-deficient condition, a relatively small 
number of 2-FA-resistant colonies from Dox(-)ISceIo(+) cell pop- 
ulations hampered the collective analysis of LOH breakpoints in 
5/m-proficient ES cells as controls (Fig. 2E). Indeed, we isolated 
only 10 independent clones (21%) from 47 5/m-proficient cell 
populations (Supplemental Fig. S7A), while isolating 43 indepen- 
dent clones (91%) from as many 5/m-deficient cell populations (data 
not shown). Moreover, among the ten isolated clones (Fig. 2E), only 
one (clone Ic-34) carried a genomic rearrangement flanking the 
Aprt locus (Supplemental Fig. S7B). PGR cloning and sequencing 



detected no breakpoint microhomology around the DEL break- 
point (Supplemental Fig. S7C; Supplemental Table S2). Thus, taken 
together, these results suggested that transient loss of BLM en- 
hances microhomology-mediated genomic rearrangement, con- 
tributing, at least in part, to the high rate of somatic LOH. 

Some short insertions at DEL breakpoints in 5/m-deficient ES 
cells can be accounted for by the loop-out and /or snap-back 
models 

As shown above, our sequence analysis identified 3- or 8-nt in- 
sertions at the DEL breakpoints in clones 1-57 and 11-74, respectively 
(Fig. 3A,B; Supplemental Fig. S6). Interestingly, these insertions 
formed at least part of short direct or inverted repeats that appeared 
on both sides of, or partially overlapped with, the DEL breakpoints. 
We propose a stepwise mechanism for these rearrangements (Fig. 4). 
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Figure 4. Templated insertions at DEL breakpoints. (A) Sequence of the DEL breakpoint in clone 1-57 (Fig. 3A). A net insertion of a 3-bp DNA stretch (5'- 
AAT-3', black bold typeface) was found between the 1 29-derived (red) 5' breakpoint (chr8: 1 25087464) and the B6-derived (blue) 3' breakpoint (l-Scel site 
inserted at chrS: 1 25097008) of the DEL interval. Part of the l-Scel recognition site is highlighted in light green. Pink and purple arrows indicate direct and 
inverted repeats, respectively. (B-f) Multi-step model based on the loop-out and snap-back models proposed in synthesis-dependent microhomology- 
mediated end joining (Yu and McVey 201 0). (B) The l-Scel site on the B6 allele is cleaved by IScelo transfection. (C) The 5' end of the DSB site is resected to 
create a 3' ssDNA tail (bottom strand). (D) The 3' ssDNA tail end (5'-TAT-3', boxed) seeks out its annealing partner on the residual top strand (5'-ATA-3') by 
forming a transient looping-out structure and (£) primes strand extension along the template strand, resulting in (F) the formation of short direct repeats (5'- 
TATATTAC-3', pink arrows). (C) The extended 3' tail end (bottom) is then "snapped back" to form a hairpin structure for annealing of the two-base 
oligonucleotides (5'-AC-3', boxed) to the complementary sequence (5'-GT-3') on the same strand. (H) A template-dependent DNA polymerase extends the 
3' end, removing some nucleotides on the fop strand. (/) The hairpin structure is unwound, and the short inverted repeats (5'-ACCCTGT-3', purple arrows) 
are formed. The newly synthesized 3' end is then annealed to ssDNA 9.5 kb centromeric to the DSB site via the microhomology (5'-ACAGGGT-3', blue bold 
typeface). Interestingly, the annealing partner resides within a genomic inverted repeat (highlighted in yellow) as shown in Figure 5. (/) Sequence of the DEL 
breakpoint in clone 11-74 (Fig. 3B). A net insertion of an 8-bp DNA stretch (5'-CAGGCACT-3', black bold typeface) is found between the 1 29-derived 5' 
breakpoint (chr8: 1 2490541 8) and 3' breakpoint (chr8: 1 25257681 ) of the DEL interval. (K-0) Multistep model: (K) A one-ended DSB is produced 1 60 kb 
telomeric to the Aprt\ocus by an undefined mechanism. (L) The 5' end of the DSB site is resected to create a 3' ssDNAtail (bottom strand). (M)Jhe 3' tail end is 
snapped back to form a hairpin structure via one-base microhomology (boxed), and (N) primes strand extension along the template strand. (O) The hairpin 
structure is unwound, and the short inverted repeats (5'-CAGTGCCTGCC-3', purple arrow) are formed. The newly synthesized 3' end is then annealed to 
ssDNA within a genomic repeat (highlighted in yellow), 1 92 kb centromeric to the Aprt locus via microhomology (5'-CC-3', boxed). 
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The complicated junctional sequences can be adequately accounted 
for by the "loop-out" and/or "snap-back" mechanisms described in 
a recently proposed DSB repair model in Drosophila melanogaster, 
termed synthesis-dependent microhomology-mediated end joining 
(Yu and McVey 2010). In clone 1-57, for example, the induction of 
I-Scel endonuclease in Dox-treated Blm^^^^^^^Aprf^"^^ ES cells cleaves 
the knocked-in I-Scel recognition site centromeric to Aprt on the B6 
allele, yielding DSBs of which the 5' ends are resected and 3' single- 
stranded DNA (ssDNA) tails are formed (Fig. 4A-C). The exogenous 
DNA sequence of each DSB end does not have its counterpart to 
which to anneal on the homologous chromosome; therefore, nor- 
mal interhomolog DSB repair by homologous recombination may 
be unfavorable. Instead, the 3' ssDNA tail can seek out its annealing 
partner by taking advantage of the first few nucleotides, which 
mediate formation of transient secondary structures by "looping- 
out" or "snapping-back" motions and prime strand extension, 
resulting in the templated insertion-mediated junction formation 
(Fig. 4D-I). The snap-back model is also applicable to the junctional 
sequence observed in clone 11-74, although the cause of the initial 
DSB at the telomeric DEL breakpoint remains unclear (Fig. 4J-0). In 
either case, after scanning the flanking DNA strands in this way, the 
newly synthesized 3' end is joined via microhomology to a se- 
quence somewhat distant from the DSB. Intriguingly, the other ends 
of these DELs were both located within genomic inverted repeats, as 
shown in the next section. 

Breakpoints of DELs in 5/m-deficient ES cells are largely 
clustered in genomic inverted repeats 

We sought to dissect the mechanism of DEL formation further by 
mapping the breakpoint sequences to mouse chromosome 8. This 
revealed that the centromeric DEL breakpoints in the majority 
(6/9) of the 129-DEL or B6-DEL clones (Fig. 3) were located within 
a distinct 4.2-kb repeat-rich region between SNP markers u-15k and 
u-lOk (Fig. 5 A). A self-alignment dot plot of the 4.2-kb repeat-rich 
region generated using BLAST2 (Tatusova and Madden 1999) 
demonstrated that all the centromeric breakpoints of the DELs 
were located in intrachromosomal inverted repeats consisting of 
short interspersed nuclear elements (SINEs) or long terminal repeat 
(LTR) retrotransposons (Fig. 5 A). Likewise, the telomeric break- 
point of the DEL in clone G-4 was located within two tandemly 
inverted (i.e., perfectly palindromic) 135-bp SINE repeats (Fig. 5B). 
Thus, the majority (7/9) of the 129-DEL and B6-DEL clones, in 
which Blm expression had been transiently switched off, under- 
went DELs, and at least one of the breakpoints was located in an 
inverted repeat. 

Discussion 

We present an in-depth experimental analysis of the immediate 
effects of functional loss of BLM on genomic integrity, which are 
difficult to assess retrospectively in affected cancer cells. Transient 
suppression of Blm expression in mouse ES cells increased the rate 
of LOH (Fig. IF), the breakpoints of which were uniformly dis- 
tributed along the chromosome (Fig. 2A; Supplemental Fig. S3B,E). 
The elevated rate of LOH was further enhanced by I-Scel-mediated 
site-directed DSB induction (Fig. IF). DSB is a common event in 
any genomic region in living cells, resulting from exposure to in- 
trinsic or environmental stress; therefore, such spontaneous DSBs 
may underlie the wide range of cancers arising in Bloom syndrome 
individuals. This motivated us to examine the genomic alterations 
at higher resolution. Conveniently, site-directed DSB induction 
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Figure 5. Clustering of DEL breakpoints in genomic inverted repeats. 
{A) Centromeric DEL breakpoints (rightward bent arrows) in six out of nine 
129-DEL or B6-DEL clones (Fig. 3) are located within a distinct 4.2-kb 
repeat-rich region between SNP markers u-15k and u-IOk (Supple- 
mental Table SI). Yellow rectangles represent repetitive sequences. A 
self-alignment dot plot using BLAST2 (Tatusova and Madden 1999) of 
a DNA fragment (chr8: 125082686-125090184) containing the 4.2-kb 
repeat-rich region is shown. Lines with a slope of +1 indicate direct repeats, 
while lines with a slope of-1 indicate inverted repeats. (S)The telomeric DEL 
breakpoint (leftward bent arrow) in clone G-4 is localized within tandemly 
inverted (i.e., perfectly palindromic) 2 x 1 35-bp SINE repeats. (d-6k) SNP 
marker located 6 kb telomeric to the Aprt locus (Supplemental Table SI). 



was found to concentrate LOH events in a specific region in Blm- 
deficient ES cells (Fig. 2B), facilitating the detailed characterization 
of LOH by a combination of region-specific high-resolution array- 
CGH, high-density SNP analyses, and direct sequencing of the 
rearrangement breakpoints. 

Possible molecular mechanisms for genomic rearrangements 
induced by a combination of transient loss of BLM 
and site-directed DSB induction 

Site-directed DSB induction during temporary loss of Blm expres- 
sion resulted in varying patterns of LOH affecting the Aprt gene 
locus on distal chromosome 8 in mouse Blm^^^^^^^Aprf^^^^ ES cells. A 
plausible explanation for the molecular mechanisms underlying 
such genomic rearrangements is shown in Figure 6. In the initi- 
ating step of DSB repair, the 5' ends of the DSB are resected to yield 
3 ' ssDNA tails, which invade the homologous DNA duplex forming 
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Figure 6. Proposed molecular mechanisms for genomic rearrangements observed in the present study. Blue and red bars represent the B6-derived and 
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deficient (BIm OFF) conditions. Corresponding clone names are indicated at the bottom. (*) Clone devoid of apparent breakpoint microhomology. 



a displacement-loop (D-loop) (Fig. 6 A; Chen et al. 2007). The in- 
vading strand is extended by DNA synthesis and often displaced 
from the template strand to anneal to the other 3 ' end of the DSB, 
leading to the exclusive formation of NCO products. This pathway, 
termed synthesis-dependent strand-annealing (SDSA), is, however, 
suppressed by BIm deficiency at least in Drosophila melanogaster 
(Fig. 6B; Adams et al. 2003). Instead, the D-loop intermediate is 
converted to double Holliday junctions (double HJs) (Fig. 6C). The 
ELM helicase, which forms a complex with DNA topoisomerase 
Illalpha, acts to enhance "convergent migration," resulting in "dis- 
solution" of double HJs and generation of NCO products (Fig. 6D; Wu 
and Hickson 2003). In contrast, the absence of BLM would sup- 
press the dissolution pathway and increase the relative contribu- 
tion of an alternative "resolution" pathway of double HJs, whereby 
DNA strands at one of the double HJs are cleaved and religated 
to generate either NCO (Fig. 6E, 11-15, -18, and -68) or CO (Fig. 6F, 
1-14, -16, and -17) products depending on the orientation of the 
cleavage strands. The extent of branch migration of HJs could be 
limited by encountering an unusual secondary DNA structure gen- 
erated by genomic repeats or genomic inverted repeats (Fig. 6G). 
Then, the stalled HJ would be disrupted by an as-yet-unidentified 
endogenous endonuclease to yield an unpaired DSB end (referred 
to as a "secondary DSB" in Fig. 6H) either on the I-Scel-cleaved 
B6 chromosome (B6-DEL: clones 11-13 and G-4) or on the template 
129 chromosome (129-DEL: clones 1-49, II-2, -56, and -69). Con- 
sidering the presence of microhomology at most of the break- 
points, one can assume that these secondary DSB ends are engaged 



in microhomology-mediated genomic rearrangements. Two major 
models, replication-independent and replication-dependent, have 
been suggested for the microhomology-mediated rearrangement 
mechanism. The former includes alternative nonhomologous end 
joining (alt-NHEJ; also known as microhomology-mediated end 
joining) (McVey and Lee 2008; Simsek and Jasin 2010), while the 
latter includes fork stalling and template switching (FoSTeS) (Lee 
et al. 2007) and its derivative, microhomology-mediated break- 
induced replication (MMBIR) (Hastings et al. 2009; Zhang et al. 
2009). It is difficult to distinguish these two major models based 
only upon the observation of breakpoint sequences, and they are 
not necessarily mutually exclusive. For example, the DEL rear- 
rangements observed in clones II-2, 11-69, and 11-74 (Fig. 3B) affect 
only the 129-derived chromosomal stretches and could be ex- 
plained by alt-NHEJ. It would be the case if two unpaired DSB ends 
arise at the same time in the same cell, and this possibility is not 
ruled out by the present analysis. Another more parsimonious 
explanation, however, would be provided by the FoSTeS or MMBIR 
model (Fig. 61; Arlt et al. 2012). Importantly, previous studies have 
shown that BLM is required for efficient replication-fork restarting, 
and a defect in BLM leads to increased replication fork stalling 
(Davies et al. 2007; Rao et al. 2007; Machwe et al. 2011), which is 
a prerequisite for the FoSTeS mechanism. Accordingly, in the FoSTeS/ 
MMBIR model, the secondary DSB end is annealed via micro- 
homology to ssDNA arising elsewhere under replication fork stalling 
and is extended along the given template strand. The microhomology 
annealing occurs in a chromosomal orientation-independent 
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manner, which could account for the presence of inverted insertion 
segments at the breakpoints, as observed in clones 1-47, 11-13, and 
G-4 (Fig. 3A,C), and also in recently published works on human 
germline and somatic genome rearrangements (Conrad et al. 2010; 
Arlt et al. 2011; Mills et al. 2011). The template-switching and ex- 
tension can be repeated until the repair process is completed. 

Besides the secondary DSB end discussed above, a primary DSB 
end generated by I-Scel cleavage (Fig. 6J) could directly provide a 3' 
ssDNA tail, which seeks other ssDNA templates to which to anneal 
via microhomology and to extend along using a replication mech- 
anism instead of HJ formation (Fig. 6K, clone 1-47). The I-Scel 
breakpoint on the B6-derived chromosome does not have its 
counterpart to which to anneal on the homologous 129-derived 
chromosome; therefore, the 3' ssDNA tail could scan the flanking 
DNA strands and occasionally form transient loop-out or snap-back 
structures, which may be maintained for longer in a BLM helicase- 
inactive state (Fig. 6L, clones 1-57 and 11-74). During the formation 
of transient structures, a short stretch of DNA is synthesized along 
a template strand and makes a breakpoint insertion, the micro- 
homology end of which mediates a rearrangement with another 
ssDNA end arising elsewhere in repeats or inverted repeats (Fig. 6M). 
The occasional finding of templated insertions at breakpoints (Fig. 
4) is readily attributed to DNA polymerase activity further sup- 
porting the replication-dependent FoSTeS/MMBIR model, rather 
than replication-independent end- joining models, as a compre- 
hensive interpretation of the rearrangement mechanism in the 
5/m-deficient situation. FoSTeS/MMBIR is thought to involve the 
repair mechanism of one-ended DSBs arising from collapsed rep- 
lication forks (Liu et al. 2012), whereas, in the present experi- 
mental study, the rearrangement mechanism should involve 
two-ended DSBs generated by I-Scel cleavage, which has not been 
implicated in previously proposed break-induced replication (BIR) 
models, with the exception of one-ended DSBs induced by I-Scel 
cleavage of prokaryotic circular genomes (Shee et al. 2012). So far, 
the limited scale of the current experiment has hampered the 
complete tracing of the two DNA ends generated by a single I-Scel 
cleavage and has resulted in fate tracing of either end of the break 
(Fig. 3 A, clones 1-47 and 1-57), which does not necessarily rule out 
the possibility that the two ends are independently repaired by the 
replication-dependent mechanism. Hicks et al. recently suggested 
the presence of a novel microhomology-mediated SDSA pathway 
in yeast (Hicks et al. 2010), which could be applicable to interpret 
the potential replication-dependent repair mechanism of two- 
ended DSBs observed in mammalian genomes. 

Within the current experimental framework, there is a practi- 
cal trade-off between short-range high-resolution genomic analyses 
and wide-range low-resolution genomic analyses. In the present 
study, the negative selection scheme by 2-FA resistance allowed 
focused high-resolution analyses of LOH involving the Aprt locus 
on chromosome 8 and detailed characterization of the LOH pat- 
terns arising in almost all clones tested (Fig. 2). However, this was at 
the expense of genome-wide characterization of rearrangement 
events, including interchromosomal translocations and more 
complex alterations. Array-CGH is a reliable method for screening 
for copy-number variations, and our custom high-resolution array- 
CGH is sensitive for discerning the difference at least between one, 
two, and three copies of genomic segments on distal chromosome 
8, which was cross-validated by direct sequencing (Supplemental 
Fig. S4). Nevertheless, in principle, it cannot detect balanced 
chromosomal inversions and translocations that often occur at 
deletion breakpoints. Next-generation sequencing technology 
could contribute to the detection of such balanced rearrangements 



and more complex alterations. In fact, recent studies have shown 
that extraordinarily complex somatic rearrangements like chro- 
mothripsis display less junctional microhomology than germline 
rearrangements (Stephens et al. 2011; Malhotra et al; 2013), and 
germline balanced rearrangements display less junctional micro- 
homology than unbalanced ones (Chiang et al. 2012). Genome- 
wide rearrangement analyses using next-generation sequencing 
platforms are currently in progress for our Dox-inducible Blm- 
knockdown cell lines. 

The localization of DEL breakpoints within annotated geno- 
mic repeats (Fig. 5) can also be seen in Drosophila melanogaster Blm 
mutants (Garcia et al. 2011), suggesting the existence of a common 
mechanism for DEL formation in a 5/m-deficient background. BLM 
helicase acts to unwind secondary DNA structures at the telomere 
(Sfeir et al. 2009). The loss of BLM may therefore contribute to 
prolonged maintenance of the unusual secondary DNA structures 
that appear in regions of repetitive sequence, such as inverted re- 
peats, which may hamper normal replication fork progression and/ 
or HJ migration. 

Potential relevance of transient loss of BLM to human 
carcinogenesis 

Bloom syndrome, an autosomal recessive human disorder of 
the BLM gene, predisposes patients to a wide variety of cancers 
(German 1997). The disorder is an inherited and lifelong condition; 
therefore, the exact contribution of BLM dysfunction to the time 
course of cancer formation is difficult to discern in clinical human 
specimens. The Dox-controlled shutoff of the Blm gene expression 
in our mouse 5/m^^^'^^^^ES cell system, however, enabled us to dissect 
the immediate effects of transient loss of BLM on genomic in- 
tegrity. When combined with DSB introduction, the transient loss 
of BLM resulted in microhomology-mediated genomic rearrange- 
ments within a period as short as several days, representing a sig- 
nificant fraction of the induced LOH events. This might point to 
an early contribution of BLM dysfunction to cancer development. 
Moreover, this underscores the importance of such genomic rear- 
rangements, in addition to increased generation of homozygosity 
for tumor-suppressor gene mutation, as a mechanism of cancer 
predisposition in Bloom syndrome. The BLM gene is a typical 
housekeeping gene and is constitutively expressed in most normal 
somatic cells. However, the BLM protein is a target of the pro- 
teolytic enzyme caspase 3 and can be cleaved into a truncated form 
that has lost its capability to interact with topoisomerase Illalpha 
(Freire et al. 2001). Our in-depth genomic analysis will provide 
a valuable foundation for understanding early genomic alterations 
not only in cancers associated with Bloom syndrome, but also in 
a wide range of human cancers. 

Methods 

Vector construction 

Construction of all the plasmid vectors used in this study is de- 
scribed in the Supplemental Methods. 

Cell lines 

Mutant mouse ES cell lines, Blm'^"'^'Fasr'*^^ , Blm'^'^^'Fasr''*^^ 
Nano/"'^^"^^^/"', and BM^^'^^^Aprf''^^, were generated by targeted 
homologous recombination. Detailed procedures are described in 
the Supplemental Methods. 
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Cell culture, transfection, and polymorphic marker analysis 

^l^tet/tetj^pj.fieo/+ ^^^h were pretreated with 0.1 mM adenine and 
20 |jLg/mL alanosine-containing medium for 4 d to eliminate pre- 
existing yiprf-deficient cells (Hong et al. 2006). Subsequent pro- 
cedures for culture, transfection, and colony isolation are shown in 
Figure IE and detailed in the Supplemental Methods. To avoid 
double counting, each clone was isolated from a different culture 
dish. Genomic DNA was extracted by a standard phenol/chloro- 
form method, and SNPs were genotyped by direct sequencing of 
PGR products (Supplemental Table SI). 

Functional assay of i-Scei endonuclease activity 

The pIRES-TK-EGFP reporter plasmid (Ogiwara et al. 2011) (kindly 
provided by Dr. T. Kohno, National Cancer Center Research In- 
stitute, Tokyo, Japan) was cotransfected with either pPB-GAG-IB 
(Control), pPB-CAG-IScel-IB (IScel), or pPB-CAG-IScelo-IB (IScelo) 
into 293FT cells using Lipofectamine 2000 (Life Technologies). 
EGFP-expressing cells were analyzed by BD FACSCanto II (BD 
Biosciences) 1 d after transfection. Alternatively, Blm^^^^^^Aprf^^^^ 
ES cells were transfected with the IScelo expression vectors and 
selected by blasticidin (100 |jLg/mL) for 4 d. The resistant cells were 
harvested to extract genomic DNA, and the 3.7-kb region encom- 
passing the I-Scel cleavage site was amplified by PGR using primers 
IScel-lK-U and PGKpro-L2. The resulting PGR products were 
digested in vitro with I-Scel endonuclease (New England Biolabs) to 
detect loss of the I-Scel site, indicative of imprecise nonhomologous 
end-joining occurring after I-Scel cleavage in vivo. The sequences of 
the primers used in this assay are: IScel-lK-U, 5'-GTGAGTATGG 
AGGTGGTGTGTGGAAGGATG-3', and PGKpro-L2, 5'-CGGGGCT 
GGTAAAGGGGATGGTGGAGAGTG-3 ' . 

Analysis of vector-mediated I-Scel expression associated 
with the cell cycle 

293FT cells were transfected with HA-tagged I-Scel expression 
vectors pPB-GAG-IScelo-IB (IScelo) or pPB-GAG-IScelo-Geminin- 
IB (Geminin) using Lipofectamine 2000 (Life Technologies). After 
2 d of culture, the cells were harvested by trypsinization, fixed/ 
permeabilized with BD Gytofix/Gytoperm solution and BD Perm/ 
Wash buffer (BD Biosciences), and incubated with a monoclonal 
antibody against HA (12GA5; Roche Diagnostics). The cells were 
then incubated with an Alexa Fluor488-conjugated goat anti- 
mouse IgG secondary antibody, followed by staining with Hoechst 
33342 (both from Life Technologies). Flow cytometric analysis was 
performed using a BD FACSAria II (BD Biosciences), and the data 
were analyzed using Flowjo software (TreeStar). 

High-resolution array-CGH analysis 

High-density probes with an average resolution of 300 bp were 
designed within the 8-Mb telomeric region of chromosome 8 in 
triplicate for an Agilent 4 X 180 K GGH array (Agilent Technol- 
ogies). The array also includes a control grid (IS-180880-4-V2_ 
4byl80K_GGH_Mm_20090225), a normalization probe group 
(Mouse_CGH_3k_ Agilent Normalization Probe Group), and 
a replicate probe group (Mouse_CGH_lk_ Agilent Replicate Probe 
Group). The remaining features were filled with probes from the 
4 X 180 K mouse GGH probe group. The array design was deposited 
with EBI ArrayExpress (http://www.ebi.ac.uk/arrayexpress/) under 
the accession number A-MEXP-221 7. Genomic DNA was labeled 
with either Gy3 (reference) or Gy5 (sample), and hybridized to the 
arrays. The arrays were scanned with an Agilent microarray scanner, 
and raw data were generated by Agilent Feature Extraction software. 
GGH calls were made with Agilent DNA Analytics software using the 



ADM2 algorithm (6.0 threshold) with a minimum of three probes in 
the region as a filter. 

Self-alignment dot plot analysis 

The DNA sequence was compared with itself using the BLAST2 
program (Tatusova and Madden 1999) with default parameters 
(http://www.ncbi.nlm.nih.gov/gorf/bl2.html). 

Data access 

The array-CGH data have been deposited with EBI ArrayExpress 
(http://www.ebi.ac.uk/arrayexpress/) under accession number 
E-MTAB-1116. 
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